Expressing Speaker's Intentions through Sentence-Final Intonations for Japanese Conversational Speech Synthesis
نویسندگان
چکیده
In this study, we investigated speaker’s intentions that the listeners perceive through subtly different sentence-final intonations. Approximately 2,000 sentence utterances were recorded and the fundamental frequency (F0) contours at the last vowel of those sentences were classified through one of the standard clustering algorithms. There found various F0 contours, namely, not only simple rising and falling intonations but also rise-fall and fall-rise intonations. In order to reveal the relationship between the intonation and the intentions, 10 representative contours were selected on the basis of the results of the clustering. Using the selected contours, a subjective evaluation was conducted. Six Japanese sentences that could have different meanings according to the sentence-final intonations were synthesized and the F0 contour at the last vowel of each sentence was replaced with the contours. The results of the evaluation by nine listeners showed that, for example, a certain falling intonation could express the intention of the “conviction” and another one that slightly differ in the shape could convey “doubt.” It was found that the subtle difference in the sentence-final F0 shape conveyed various nuances and connotations.
منابع مشابه
Expression of speaker's intentions through sentence-final particle/ intonation combinations in Japanese conversational speech synthesis
Aiming to provide the synthetic speech with the ability to express speaker’s intentions and subtle nuances, we investigated the relationship between the speaker’s intentions that the listener perceived and sentence-final particle/intonation combinations in Japanese conversational speech. First, we classified F0 contours of sentence-final syllables in actual speech and found various distinctive ...
متن کاملPhonetic investigation of boundary pitch movements in Japanese
Pitch movements at the boundaries of sentence-medial and final phrases in Japanese can provide a cue to the speaker's intention. For example, the phrase /Nagano-de/ ' in Nagano' can be uttered with different rising and/or falling pitch movements on the the final mora /de/ to convey meanings such as clarification, incredulity, prominence in the discourse, insistence, etc. The identification of t...
متن کاملSpeech Synthesis with Attitude
F0 characteristics were analyzed and modeled for the output of speech with natural prosody in communication systems. Lexicons were selected to express speaker's attitude during the human speech generation process. We modeled the prosody using information of constituent lexicons expressing attitude and markedness. Motivated by preliminary observations of prosodic variations in conversational spe...
متن کاملA proposal of a model to extract Japanese voluntary speech rate control
To extract elements of prosodic features which relate to speakers' intentional control is required for speech information processing. Speech rate variation should be a "caution signal" to call listeners' attention strongly. To express and detect such "caution signals", we have proposed a new speech rate model. This model introduces two kinds of force to control the speech rate. One is a driving...
متن کاملLexical tone production by Cantonese speakers with parkinson's disease
The aim of this study was to investigate lexical tone production in Cantonese speakers associated with Parkinson’s disease (PD speakers). The effect of intonation on the production of lexical tone was also examined. Speech data was collected from five Cantonese PD speakers. Speech materials consisted of targets contrasting in tones, embedded in different sentence contexts (initial, medial and f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012